Extracting fine-grained location with temporal awareness in tweets: A two-stage approach

نویسندگان

  • Chenliang Li
  • Aixin Sun
چکیده

Twitter has attracted billions of users for life logging and sharing activities and opinions. In their tweets, users often reveal their location information and short term visiting histories or plans. Capturing user’s short term activities could benefit many applications for providing the right context at the right time and location. In this paper, we are interested in extracting locations mentioned in tweets at fine-grained granularity, with temporal awareness. More specifically, we like to recognise the point-of-interests (POI) mentioned in a tweet and predict whether the user has visited, is currently at, or will soon visit the mentioned POIs. A POI can be a restaurant, a shopping mall, a bookstore or any other fine-grained location. Our proposed framework, named TS-Petar (Two-Stage POI Extractor with Temporal Awareness), consists of two main components: a POI inventory and a two-stage time-aware POI tagger. The POI inventory is built by exploiting the crowd wisdom of Foursquare community. It contains both POIs’ formal names and their informal abbreviations, commonly observed in Foursquare check-ins. The time-aware POI tagger, based on the Conditional Random Field (CRF) model, is devised to disambiguate the POI mentions and to resolve their associated temporal awareness accordingly. Three sets of contextual features (linguistic, temporal, and inventory features) and two labeling schema features (OP and BILOU schemas) are explored for the timeaware POI extraction task. Our empirical study shows that the subtask of POI disambiguation and the subtask of temporal awareness resolution call for different feature settings for best performance. We have also evaluated the proposed TS-Petar against several strong baseline methods. The experimental results demonstrate that the twostage approach achieves the best accuracy and outperforms all baseline methods in terms of both effectiveness and

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

When twitter meets foursquare: tweet location prediction using foursquare

The continued explosion of Twitter data has opened doors for many applications, such as location-based advertisement and entertainment using smartphones. Unfortunately, only about 0.58 percent of tweets are geo-tagged to date. To tackle the location sparseness problem, this paper presents a methodical approach to increasing the number of geotagged tweets by predicting the fine-grained location ...

متن کامل

A fine-grained sentiment analysis approach for detecting crisis related microposts

Real-time information from microposts like Twitter is useful for applications in the crisis management domain. Currently, that potentially valuable information remains mostly unused by the command staff, mainly because the sheer amount of information cannot be handled efficiently. Sentiment analysis has been shown as an effective tool to detect microposts (such as tweets) that contribute to sit...

متن کامل

Joint Recognition and Linking of Fine-Grained Locations from Tweets

Many users casually reveal their locations such as restaurants, landmarks, and shops in their tweets. Recognizing such fine-grained locations from tweets and then linking the location mentions to well-defined location profiles (e.g., with formal name, detailed address, and geo-coordinates etc.) offer a tremendous opportunity for many applications. Different from existing solutions which perform...

متن کامل

Extracting fine-grained durations for verbs from Twitter

This paper presents recent work on a new method to automatically extract finegrained duration information for common verbs using a large corpus of Twitter tweets. Regular expressions were used to extract verbs and durations from each tweet in a corpus of more than 14 million tweets with 90.38% precision covering 486 verb lemmas. Descriptive statistics for each verb lemma were found as well as t...

متن کامل

EmoTwitter - A Fine-Grained Visualization System for Identifying Enduring Sentiments in Tweets

Traditionally, work on sentiment analysis focuses on detecting the positive and negative attributes of sentiments. To broaden the scope, we introduce the concept of enduring sentiments based on psychological descriptions of sentiments as enduring emotional dispositions that have formed over time. To aid us identify the enduring sentiments, we present a fine-grained functional visualization syst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JASIST

دوره 68  شماره 

صفحات  -

تاریخ انتشار 2017